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Oescripti n 

Background of the Invention 

5 This invention relates to materials and methods for the detection of DNA 5-cytosine methyltransferase (5-C-DNA 

methyltransferase) and more particularly relates to a novel oligodeoxyribonudeotide, i.e. ODN, as a substrate for selec- 
tive detection and quantification of mammalian 5-C-DNA methyltransferase. 

Regulation of gene expression in eukaryotic organisms involves many different biological mechanisms that operate 
independently of one another. DNA methylation is among these mechanisms associated with the regulation of gene 

10 expression. The enzyme which is responsible for this process, DNA methyltransferase, acts by transferring a methyl 
group from S-adenosylmethionine to a cytosine residue in a CG-sequence of DNA to give an altered sequence contain- 
ing 5-methylcytosine. Not all CG sequences in DNA are methylated by the enzyme and not all DNA methyl transfer 
reactions are involved in gene expression, but there is a consistent, well-documented correlation between DNA meth- 
ylation and the inhibition of the activity of many genes. 

is Attempts to clarify the role that DNA methylation plays in gene expression have been hampered by the lack of a 
specific inhibitor of DNA methyltransferase. In this regard, the compounds 5-azacytidine (azaC) and 5-aza-2'-deoxycy- 
tidine (azadC) have been used to study inhibition of DNA methylation. Both azaC and azadC are metabolized and are 
ultimately incorporated into cellular DNA. Subsequently, DNA which contains azadC directly inhibits DNA methyltrans- 
ferase and thus, the overall process of DNA methylation. However, DNA which contains azadC also interferes with a 

20 variety of other DNA protein-interactions and so the cellular effects of azadC can not be ascribed solely to its ability to 
inhibit DNA methylation. There remains a need for the systematic design and synthesis of totally specific inhibitors of 
DNA methyltransferase. 

For example, methylation of Cytosine (C) residues in DNA plays an important role in regulating gene expression 
during vertebrate embryonic development. Conversely, disruption of normal patterns of methylation is common in 

25 tumors and occurs early in progress ion of at least some human cancers. In vertebrates, it appears that the same DNA 
methyltransferase (DNA MTase) maintains pre-existing patterns of methylation during DNA replication and carries out 
de novo methylation to create new methylation patterns. There are several indications that inherent signals in DNA 
structure can act in vivo to initiate or block de novo methylation in adjacent DNA regions. 

In vertebrate cells, about 3% of cytosine (C) residues in DNA have a methyl group on carbon 5 and 5-methyl cyto- 

30 sine (5mC) is the only naturally occurring modified base so far detected in DNA. Enzymatic methylation of Cytosine res- 
idues in DNA occurs postreplicatively and primarily involves C residues in CpG dinucleotides, although methylation has 
been observed at C residues 5' of other nucleotides. The extent and pattern of methylation ol genomic DNA is species- 
and tissue-specific, which implies that the pattern of methylation is faithfully inherited in all cells of common lineage 
within a tissue. Analysis of methylation patterns of specific genes during development suggests that patterns estab- 

35 lished in sperm and oocytes are lost during early development, that regions other than CpG islands become almost fully 
methylated, and that loss of methylation occurs at specific sites in tissues where a gene is expressed. 

Although not all genes are regulated by methylation, hypomethylation at specific sites or in specific regions in a 
number of genes is correlated with active transcription. DNA methylation in vitro can prevent efficient transcription of 
genes in ceil-free systems or transient expression of transfected genes; methylation of C residues in some specific cis- 

40 regulatory regions can also block or enhance binding of transcription factors or repressors. DNA methylation is involved 
in inactivation of one of the two X chromosomes in female mammalian somatic cells, and allele-specific methylation has 
been proposed as a factor in genomic imprinting. The most direct evidence for the importance of DNA methylation in 
development is the demonstration that homozygous mutation in murine DNA 5-cytosine methyltransferase (5-C- 
MTASE) leads to impaired embryonic development: 

45 Conversely, disruption of normal patterns of DNA methylation has been linked to development of cancer. The 5- 
methylcytidine (5MeC) content of DNA from tumors and tumor-derived cell lines is generally lower than in normal tis- 
sues, although increased methylation of CpG sites occurs in some genes and chromosome regions. While these obser- 
vations support the concept that methylation patterns are established in the embryo and altered during carcinogenesis 
by a combination of de novo methylation and loss of methylation in a time-, sequence-, and tissue-specific manner, the 

so mechanism(s) by which these changes occur and are regulated with such apparent precision has not been defined. 

The processes involved in regulating de novo methylation are particularly puzzling. As would be predicted for an 
enzyme that maintains established patterns of methylation during DNA replication, mammalian DNA MTases have a 
much greater capacity for methylating hemimethylated CpG sites in double-stranded (ds) DNA than completely unmeth- 
ylated sites. However, since the genje encoding mammalian DNA 5-C-MTase is present as a single copy per haplotd 

55 genome and there is no direct evidence for the existence of a separate de novo DNA MTase, it appears that the same 
enzyme must carry out both functions. 

It should be pointed out that bacterial 5-C-DNA methyltransferases do not function in the same way as and are not 
the same as mammalian 5-C-DNA methyltransferases but do have sufficient similarity in mechanism that to date the 
presence of bacterial 5-C-DNA methyltransferase can interfere with detection methods for mammalian 5-C-DNA meth- 
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yltransferase. 

In any case, mammalian DNA methyltransf erases are involved in gene expression and the activity of this enzyme 
is elevated in the colon mucosa of patients at risk for colon cancer. It has also been shown that lowering the level and 
activity of DNA 5-MTase also lowers the incidence of colon cancer in transgenic mice that develop this disease sponta- 
£ neously. It is therefore important to be able to detect and quantify mammalian 5-C-DNA methyltransferases and distin- 
guish them from other 5-C-DNA methyltransferases which cause different functional effects. 

Brief Description of the Drawings 

to Figure 1 shows a generic structure of ODN's having 

C G 
G C 

75 

bonding. 

Figures 2 through 9 show a variety of specific ODN's all of which have 

20 C G 

G C 

bonding, except for Figure 5 which shows a related ODN loop structure. 

25 

Brief Description of the Invention 

In accordance with the present invention, a novel DNA analog has been developed which specifically interacts with 
mammalian 5-C-DNA methyl transferase. The DNA analog acts as a substrate for selective detection of mammalian 5- 

30 C-DNA methyl transferase even in the presence of bacterial 5-C-DNA methyl transferase. The substrate comprises oli- 
gomeric DNA (ODN), which contains at least one 5 methyl 2' deoxycytosine (5mC) residue, and at least one cytosine 
(c) or 5-fluorocytosine (5FC) residue, each of which are followed in linkage to a guanine (G) residue. 

The novel substrates of the invention are single-stranded with the potential to form transient looped structures and 
preferably contain from 12 to 50 nucleic acid bases and at least two C residues, at least, one of which is 5mC and at 

35 least one is a C in a nucleotidyl linkage to G (CG). this C may be C or, 5FC and modifications may be made in the sugar 
or phosphodiester moieties of the phosphodiester linkage to increase resistance of the ODN to nuclease digestion. 

The design of the DNA analogs of the invention which specifically interact with mammalian DNA methyltransferase 
has several distinct advantages. In particular, the effects of these analogs are specifically ascribed to their effects on 
DNA methyltransferase and therefore the process of DNA methylation in cells. These analogs are useful as research 

40 tools to elucidate the basic mechanistic behavior of mammalian DNA methyltransferases and to provide a highly sensi- 
tive means to measure (or inhibit) DNA methyltransferase activity of tumor cells, whether obtained from established cell 
cultures or from clinically obtained tumor tissue samples. Furthermore, the technology exists to develop these DNA 
analogs as diagnostic tools for the premalignant state in cancer progression and for following responses to the treat- 
ment of cancers, and particularly, colon cancer. 

45 Examples of particular embodiments of the invention include the following oligomeric DNA strands. For purposes 
of all sequences disclosed herein, to meet nucleotide sequence disclosure requirements R and N may both be repre- 
sented by N. 

5'-ATTGCGNATTNCGGATNRGCGATC-3* (Seq. ID #1) 
so S'-ATTGCGNATTCCGGATCRGCGATC-S* (Seq. ID #2), and 
5'-ATTGNGCATTCCGGATCRGCGATC-3' (Seq. ID #3) 

where N is 5-methyl-cytosine and R is either cytosine or 5-f luoro-cytosine. 
The related double-stranded substrates such as: 

55 

5'-ATTGNGCATTCNGGATCNGNCATC-3' (Seq. ID #4) 
3'-TAARGCGTAAGGRCTAGGRGGTAG-5' (Seq. ID #5) 

where N is 5-methyl-cytosine and R is either cytosine or 5-fluoro-cytosine which act as matched control substrates, 
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since they provide a highly sensitive means to measure (or inhibit) the activity both mammalian 5-C-DNA methyltrans- 
ferase and those bacterial 5-C-DNA methyltransferases that recognize the specific order of residues in this substrate. 

Detailed Description of the Invention 

5 

"Oligomeric DNA" or ODN, as used herein, means a DNA having from 10 to 75 "purine or pyrimidine bases" con- 
nected in a series of nucleotides which are deoxyribonucleotides or analogs of deoxyribonucleotides with chemically 
modified sugars or linkages between sugars. Purine or pyrimidine bases as used herein mean adenine (A), cytosine 
(C). guanine (G), thymine (T), and analogs of C (5-methyl-cytosine. 5-f luoro-cytosine, 5-bromo-cytosine, etc.) that have 
io been chemically modified so as to continue to be chemically acceptable into a synthetic chain of deoxyribonucleic acid 
or deoxyribonucleic acid analogs. 

"5-C-DNA methyl transferase" means DNA 5-cytosine methyl transferase for methylation of cytosine to form a 5- 
methyl-2*-deoxycytosine residue in DNA. 

To obtain DNA analogs which are highly specific for mammalian DNA methyltransferase, we have artificially con- 
is structed ODN analogs of DNA which contain CG sequences that have the intrinsic potential to be recognized and meth- 
ylated by DNA methyltransferases. In general, we have chosen ODN constructs which are 24 bases in length, or 
24mers, but our invention can also apply to ODNs that are shorter and/or longer than 24 bases in length e.g., 10 to 75 
and preferably 12 to 50 bases in length. We have examined the activity of these ODNs as substrates for DNA methyl- 
transferase, and have identified single-stranded as well as double-stranded constructs of DNA which are excellent sub- 
20 strates of DNA methyltransferase. 

Four examples of DNA constructs are shown in Table 1. They are labeled Sense (S), Antisense (AS), Methylated 
Antisense 1 (M1) and Methylated Antisense 2 (M2). The mammalian DNA methyltransferase substrate activity of these 

4 single-stranded ODNs is shown in Table 2. It can be seen that M2 has exceptional substrate activity, especially in 
comparison with S and AS, which were designed to have an optimal number of potentially methylatable CG sites avail - 

25 able. In contrast to the remarkable substrate activity of M 2 for mammalian DNA methyltransferase, it is not methylated 
by most bacterial DNA methyltransferases which require double stranded DNA as a substrate. Even with bacterial Sssl 
methyltransferase which has similar specificity to mammalian DNA methyltransferase in that it methylates CG and can 
methylate single stranded DNA. M2 has only moderate activity as a substrate (see Table 3). This property of M 2 pro- 
vides an approach to accurately monitoring levels of DNA methyltransferase activity in clinically obtained tissue sam- 

30 pies that might otherwise be contaminated by bacteria, since bacterial DNA methyltransferase activity would be 
excluded as a component of this measurement. 

Three examples of double-stranded DNA analog constructs are shown in Table 4: S + AS, S + M1 , and S + M2. 
Their activity as mammalian DNA methyltransferase substrates is also shown in this Table and it can be seen that the 
double-stranded ODN, S + M1 has exceptional substrate activity which is comparable to that observed for the single 

35 stranded ODN, M2. 

Thus, we have artificially constructed and identified single-stranded ODN and double-stranded ODN, both with 
exceptional activity as substrates of mammalian DNA methyltransferase. To our knowledge, the composition (i.e. the 
exact base sequence) of these synthetic substrates is unique. In addition, to our knowledge, there are no other reports 
of methylated single stranded DNAs serving as exceptional substrates for mammalian DNA methyltransferase and, in 

40 particular, no previous example of substrate activity in oligomers with 5mC residues in sites other than CG sites. Since 
methylation occurs at a CG site cfistant from the 5mC residues, this substrate is undergoing de novo methylation. 

It is known to those skilled in the art that double-stranded DNAs including double stranded ODNs with multiple 
5mCGs in one strand base-paired with CGs in the complementary strand are substrates for mammalian DNA methyl- 
transferases and we do not consider this aspect of our invention proprietary. The characteristics of the double-stranded 

45 ODNs described here that are considered unique are 1 ) their sequence which is unique in that at least one ODN of the 
double stranded ODN is an excellent substrate for mammalian but not bacterial methyl transferases and 2) their design 
which renders them as substrates for both mammalian and specific bacterial DNA methyltransferases when present in 
the double stranded form, 3) their extremely high substrate activity for mammalian 5-C -methyltransferases. These dou- 
ble-stranded ODNs also serve as unique controls for the single- stranded ODNs that are the invention in that they will 

so detect both mammalian and other methyltransferases present in test samples while the matched single-stranded ODN 
will detect only the mammalian enzyme. 

In order to obtain highly potent inhibitors of mammalian DNA methyltransferase, we have selected ODNs with 
exceptional activity as substrates of DNA methyltransferase and have modified them by systematically replacing the C 
of each CG sequence with 5-f luoro-cytosine (FC), an analog of cytosine which cannot be methylated and which further- 

55 more, can react irreversibly with the enzyme in a manner which leads to inhibition of the DNA methyltransferase. This 
strategy is illustrated in Table 5 which shows how the double stranded DNA analog substrate of DNA methyltransferase. 

5 + M1, was systematically modified by the replacement of FC for C in this ODN, to produce a series of FC-modified 
ODNs of S + M1, for testing as potential inhibitors of DNA methyltransferase. 

Table 6 illustrates how this strategy was applied to another ODN substrate of DNA methyltransferase, which we 
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have designated as ASM7. The systematic substitution of FC for.C in this ODN led to a series of FC-modified ODNs. 
From this series, a potent DNA methyltransferase inhibitor, ASM7F18, was identified. 

Thus, we have systematically modified substrates of DNA methyltransferase, by substituting FC for C in CG 
sequences, to construct potent inhibitors of DNA methyltransferase. 
5 A unique aspect of our invention is that the ODNs we have developed will provide a means to measure active 
enzyme in complex mixtures of proteins typical of cells and cell extracts. This contrasts to the use of antibodies for 
detecting enzyme, a method which measures both active and inactive enzyme, and cannot distinguish the active com- 
ponent. This contrasts also to methods which measure mRNA content which cannot be used to quantitate the amount 
of active enzyme protein. 

10 ODNs of the invention which we have prepared contain 5-m ethyl -cytosine (5-meC) in addition to the standard 
bases (A.T.G.C). These ODNs can be prepared routinely on a DNA synthesizer. 

Some ODNs we have prepared for purposes of comparison contain only standard bases (A,T,G.C) and can be pre- 
pared routinely on a DNA synthesizer. 

Some ODNs we have prepared contain 5-f luoro-cytosine (5FC) in addition to the standard bases (A.T.G.C) and 5- 
75 meC. We have developed a simple and convenient method for incorporating FC into ODNs and have published these 
procedures which we do not consider proprietary and are now well known to those skilled in the art. 
The substrates of the invention have numerous utilities and may, for example, be used as: 

1. substrates for evaluating levels of enzyme in experimental systems either by direct measurement of enzyme 
20 activity or by quantitating covalent complex formation with ODNs containing FC; 

2. reagents for specifically inhibiting DNA methyltransferase, as opposed to other methyltransferases in crude cell 
extracts or in cells in culture or in animal models; 

3. reagents for detecting the presence of as yet uncharacterized CG methyltransferases and for determining 
sequence specificity of these enzymes or for determining which sites in known sequences are methylated (i.e., by 

25 inserting FdC in different positions); 

4. reagents for isolating and purifying DNA methyltransferases from cells or tissues. Biotinylated or tethered ODNs 
containing FC would be used for this application: 

5. agents for differentiation therapy; 

6. diagnostic tools where accurate measurement of DNA methyltransferase activity in clinically obtained tissue 
30 samples can be indicators of the extent of disease progression in carcinomas or alternatively as an indicator of 

therapeutic responses; and 

7. probes for in situ detection of active DNA methyltransferase in frozen sections. This would be accomplished 
using biotinylated ODNs containing FC. Avidin linked alkaline phosphatase or peroxidase would then be reacted 
with the bound biotinylated ODNs to allow use of standard histochemical techniques for identifying cells or areas in 

35 tissues and/or tumors with high and low levels of active DNA methyltransferase. 

Single-stranded ODNs of the invention with high specificity for mammalian DNA methyltransferase and their 
sequence-related double-stranded ODNs with high substrate activity for both mammalian and bacterial DNA methyl- 
transferases have broad based general utility. Certain instances of the utility of these ODNs involve assays done using 

ao whole cells or tissue samples (i.e., colon) and so it is essential that these single-stranded and double-stranded ODNs 
(substrates and inhibitors) with high specificity for cell free preparations of DNA methyltransferase, also demonstrate 
similar activity as substrates and/or inhibitors of DNA methyltransferase in whole cells. Consequently, we have under- 
taken experiments to determine whether these ODNs 1) can enter whole cells and 2) are active against DNA methyl- 
transferase in whole cells. For these experiments, several ODNs were synthesized that are end capped with 

45 phosphorothioate linkages, a standard procedure used to increase resistance of ODNs to degradation by cellular nucle- 
ases. Resulting data supports the proposition that capping with phosphorothioate linkages does not adversely affect the 
ability of oligomer A to inhibit methylation by DNA methyltransferase (using standard cell free enzyme assay) and in 
fact, capping appears to stimulate the rate of methylation of oligomer D (ASM 7 ) by more than 50% (Compare activity of 
D and C) (Table 7). Incorporation of FC in position 18 of oligomer A inhibits this methylation by 97% as would be pre- 

so dieted when covalent complexes are formed with DNA methyltransferase. 

Having established that specific phosphorothioate end capped ODNs retain activity against DNA methyltrans- 
ferase, their biological effects were examined in intact Friend cells: the biological effects of oligomer A were compared 
to oligomer B which has no available substrate sites and thus cannot be an inhibitor of DNA methyltransferase, and of 
oligomer D with oligomer C which has no FC. 

55 A test was conducted to determine if these oligomers affect growth or differentiation of Friend leukemia cells. If 
growth effects are due to the toxicity of FC released from oligomers, then oligomer A and oligomer B should have equiv- 
alent effects and oligomer C should have no effect on growth. 

Alternatively, rf growth effects are due to an effect on methylation, then oligomer B which has no substrate site 
should be inactive and oligomer A which has FC in the primary substrate site should be active. Oligomer C may have a 
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minor effect if it competes with endogenous DNA for DNA methyltransferase binding in a reversible manner. If, however, 
growth effects are due to a generalized effect of high oligomer concentration, then all oligomers, A, B and C, should 
affect growth similarly. 

When cells were treated with 5mM concentrations of oligomers, it was seen that in fact oligomer A is more potent 

5 that B in inhibiting growth and that oligomer C has no appreciable effect on inhibiting growth. These results are consist- 
ent with the idea that the oligomers are taken up by the cells and that growth effects are due in part to an effect on meth- 
ylation since oligomer A is more potent than oligomer B. 

The single-stranded ODN we have designated M 2 is a "super substrate" for mammalian but not bacterial DNA 
methyltransferase. We believe the composition of this and related artificially constructed single-stranded substrates 

io with loop forming potential such as ASMS, ASM7 to be unique. Unlike other known single-stranded substrates used for 
in vitro assessment of enzyme activity, these substrates appear to interact with both an activation and a catalytic site on 
the enzyme. There is currently a push to monitor levels of DNA methyltransferase in colon because of evidence from 
Baylin et al. (Increased Cytosine DNA-Methyttransferase Activity During Colon Cancer Progression, JNCI . 85, 1235, 
1993) that the activity of this enzyme is increased in colon mucosa of patients at high risk for cancer, i.e., 50% of 

75 patients with familial polyposis had elevated levels of DNA methyltransferase (1.4-fold) and with colon cancers had 3 
fold elevations. Since the assay is carried out using whole homogenized colon tissue, the possibility for false positives 
due to bacterial contamination is great. Our M 2 substrate would obviate this problem. The M 2 substrate and its 5-f luoro- 
2'-deqxycytidine containing analog and other functionally related substrates and their use in diagnostic assays for DNA 
methyltransferase are considered unique. 

so We have synthesized both single-stranded and double-stranded ODNs substituted with methyl deoxycytosine and 
fluoro deoxycytosine that inhibit DNA methylation and are capable of forming covalent complexes with DNA methyl - 
transferases. The design of ODNs to inhibit two distinct DNA methylation processes, de novo methylation or mainte- 
nance methylation are unique. These ODNs are useful in determining the substrate specificity and mechanism of action 
of enzymes. The use of such ODNs for (a) quantitation of active enzyme by an assay that would be simpler than those 

25 previously developed and would involve filter-binding of radiolabeled or enzyme-linked DNA containing FdC; (b) isola- 
tion of enzymes from different sources by using biotinylated ODNs or ODNs linked to chromatography matrices; and (c) 
specific inhibitors of de novo enzymes in vivo are unique. Such inhibitors will be of great importance in development of 
gene therapy, since many genes are shut off by methylation after uptake into cells. These ODNs could also be useful in 
treating specific cancers, since inhibition of DNA methylation has been shown to cause differentiation of certain types 

30 of tumor cells. 

The phosphoramidite of 5-f luorodeoxycytidine (5FdC) and unmodified 5-methyl deoxycytosine (5MdC) and 5FC- 
containing ODNs were synthesized and purified as described in Marasco, C.J. et al., J. Org. Chem. 57, 6363-6365 
(1992). 5MdC phosphoramidite is commercially available, ss ODNs were heated to 90°C for 10 min and quickly chilled 
on ice immediately prior to assay. The ss ODNs were not substrates for Hpa II or Hha I DNA methyltransferases 

35 (MTases) at 37°C. indicating an absence of stable intermolecular ds regions. For ds ODNs, an equimolar mixture of 
complementary ODNs was heated to 90°C for 10 min and slowly cooled to room temperature. These ds ODNs were 
susceptible to quantitative cleavage by Hpa II or Hha I when their recognition sites contained no 5MeC. 

Preparation of Murine DNA 5-C-MTase. The DNA 5-C-MTase used in these studies was the 100,000 x g superna- 
tant of a 0.3 M NaCI extract of Friend erythroleukemia cell nuclei. All procedures were described in Wainfan et al., Can- 

40 cer Res. 49, 4094-4097 (1989), except for the addition of 5 \iq each of antipain dihydrochloride, leupeptin, chymostatin, 
and pepstatin (Boehringer Mannheim) to the extraction buffer. 

Methylation Assay. Reaction mixtures (50 in 0,1 M imidazole, pH 7.4/20 mM EDTA/0.5 mM dithiothreitol con- 
tained 0.5 fig of each ODN indicated, -0.6 units of DNA 5-C-MTase (1 unit transfers 1 pmol to A (AS) • A*M p (M1) per 
min; see Table 7), and 2.8 nCi (1 Ci = 37 GBq) of [rnefny/- 3 H] AdoMet (AdoMet = S-adenosylmethionine) (8 hM). Sub- 

45 strate C sites are in excess in the reaction, and methyl transfer is linear for >45 min. For accuracy, methylation rates <5 
pmol/30 min were measured in quadrupled reaction mixtures (200 jjlI>. After incubation for 30 min at 37°C, ODNs were 
processed for quantitation of radiolabel as described (20) with 25 \ig of salmon sperm DNA added as carrier prior to 
perchloric acid precipitation. 

ODNs A (AS) and A' (S) (Table 8) were tested for their ability to act as substrates for methylation by murine DNA 5- 
50 C-MTase by measuring the initial rate of methyl transfer from [methyl~ 3 H] AdoMet to these ODNs with substrate in 
excess. Neither A nor A' was efficiently methylated in the ss form, although A was methylated at almost three times the 
rate of A' (Table 8). Since A and A* have the same number and spacing of CpG sites, this indicates that the density and 
spacing of CpG sites are not sufficient to establish the rate of methylation. When A and A' were annealed to form an 
unmethylated ds ODN substrate, the rate of methylation was no higher than that obtained with A' alone, even though 
ss twice as many sites per mole of substrate were available for methylation. 

All assays were performed in duplicate. Values shown are the average incorporation in three assays ± SD and rep- 
resent the initial rate of methylation-i.e., incorporation of [ 3 H]CH 3 into DNA, during a 30 min. incubation carried out and 
quantitated as described in Materials and Methods. With hemimethylated substrates (lines 4 and 5), *10% of available 
CpG sites were methylated in 30 min. Background incorporation in the absence of substrate was about 500 dpm., rate 
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equals rate of methyiation relative to A1 and ND equals, not detected. 

When either A or A' in the ds ODN contained 5MeC residues in place of all C residues in CpG sites forming hemi- 
methylated sites (Mp), the rate of methyiation of the unmethylated strand was increased >130-fold relative to the rate of 
methyiation of A in the ss form (compare line 1 with lines 4 and 5 in Table 8). No methyiation of completely methylated 

£ substrate could be detected, and substitution of 5MeC for C in non-CpG sites (M x ) in the ds ODNs did not stimulate the 
rate of methyiation significantly above that of completely unmethylated substrate (compare line 3 with lines 7 and 8 in 
Table 8). Thus, the interaction of murine DNA 5-C-MTase with A and A' in ss and ds forms does not differ detectably 
from its interaction with longer ss- and dsDNA substrates; i.e., hemimethylated DNA is methylated much more efficiently 
than completely unmethylated ds- or ssDNA (Bestor, et al., Proc. Natl. Acad. Sci., USA 80. 5559-5563 (1983)). The 

10 results also demonstrate that 5MeC in non-CpG sites fails to stimulate methyiation of dsDNA, even when potential 
methyiation sites (CpG sites) are no more than one or two base pairs distant. 

In contrast, substitution of a 5MeC residue(s) for a C residue(s) in different sites in ss ODNs had widely varying 
effects on the rate of methyiation (Table 9, rate in the tables is the rate of methyiation relative to A (0.15 ± 0.01 pmol of 
[methyl 3 H] Ado Met per 30 min., ND = non detectable). 5MeC in all CpG sites in either A or A' (ODN A and A', lines 2 

is in Table 9, effectively blocked methyl transfer, which is predictable if all methyiation occurs at C residues in CpG sites. 
However, substitution of 5MeC for all C residues next to other nucleotides had a markedly different effect on methyiation 
of ss and ds ODNs. The rates of methyiation of ss ODNs A and A* increased dramatically, approaching or surpassing 
those of hemimethylated ds ODNs (compare lines 4 and 5 in Table 8 with line 3 in Table 9); again A was the better sub- 
strate (Table 9, fines 3). Since 5MeC substituted for C in non-CpG sites had no effect on methyiation rates of ds ODNs 

20 (Table 8. lines 7 and 8), this suggests that 5MeC in ssDNA can stimulate methyiation of CpG sites in cis. 

To determine which specific 5MeC residues stimulate methyiation, derivatives of A (AS) and A' (S) were synthe- 
sized with single substitutions of 5MeC for C, A single 5MeC near the 5' end of ODN A, replacing either the C residue 
in position 5 (C5) or position 7 (C7), stimulated methyiation as effectively as the three 5MeC residues replacing C7, 
C1 1 , and C17 of ODN A (Table 9. compare lines 3-5; ODN A), In contrast A strands with 5MeC residues substituted for 

25 C residues in other sites were only marginally better substrates than unmethylated A strands. The exception was a 
5MeC in position 18, which increased the rate of methyiation of A «12-fold (Table 9. line 7; ODN A). These results con- 
firmed that 5MeC residues in ss ODN substrates do not have to be in a CpG site to activate methyiation. Introduction 
of 5MeC residues into A' demonstrated that the presence of a 5MeC residue near the 5' end of an ODN is not in itself 
sufficient to increase the initial rate of methyiation (Table 9, line 4; ODN A*) and that 5MeC residues in the middle (lines 

30 5 and 7; ODN A') or at the 3' end (line 9; ODN A') of an ODN can also activate methyiation. Clearly, factors other than 
density of 5MeC residues or their position relative to the 5' end are important in determining which 5MeC residues can 
serve as activators of methyiation. 

Substitution of C residues with 5FC was used to determine which sites become substrates for enzymatic methyia- 
tion in A strands containing 5MeC. It has been shown that both bacterial and mammalian DNA 5-C-MTases form stable 

35 covalent linkages with 5FC residues in DNA during the process of methyiation. Under conditions of substrate excess, 
this leads to rapid inactivation of the enzymes. We have found that (i) stable covalent complexes between murine DNA 
5-C-MTase and 5FC residues are only formed when 5FC residues are in substrate CpG sites, and (ii) our DNA MTase 
extracts contain only one species of protein (-190 kDa) that forms covalent complexes in an AdoMet-dependent man- 
ner with ODNs with 5FC in substrate sites. 5FC substitution for C residues in all CpG sites (C residues at positions 5, 

40 12, 18, and 20) of AM X completely inhibited methyiation, reconfirming that C residues in CpG dinucleotides are sub- 
strates (Table 10, line 3). Single substitutions of 5FC for C5, C1, and C20, had little effect on the rate of methyiation of 
AM X , whereas substitution of 5FC for C18 almost completely inhibited methyiation (Table 10, line 5). The same result 
was obtained with A ODNs containing a single 5MeC in position 5 or 7; i.e., 5FC in Position 18 completely inhibited 
methyiation while 5FC in position 20 had no effect on the rate of methyiation (Table 10. lines 9, 10, 12, and 13). Thus, 

45 5MeC in position 5 or 7 activates DNA 5-C-MTase to methylate C18 while failing to activate methyiation of a C residue 
only two bases downstream. Since the distance between C5 and C18 and between C7 and C20 is the same, this result 
suggests a very specific relationship between DNA structure and/or sequence and the recognition of substrate sites in 
ssDNA that is not strictly related to distance between sites. It can also be concluded (i) that 5MeC residues can activate 
methyiation of both ss- and dsDNA and that in both cases the substrate C residue is in a CpG site; (ii) that in completely 

so dsDNA, 5MeC residues must be located in CpG sites either to block the use of DNA as a substrate or to activate meth- 
yiation (Table 8); and (iii) that in ssDNA, 5MeC residues must be in CpG sites to block methyiation (compare ODN A', 
lines 5 and 10 and ODN A and A\ lines 2 in Table 8) but not to serve as activators of methyiation. 

Smith et al. (J. Mol. Biol. 17, 39-51 [1991] and Biochemistry 31. 850-854 [1992]), have previously noted that in 
dsDNA substrates or in ssDNAs where the substrate CpG site is present in the ds stem of a long-stemmed stem-loop 

55 structure, the substrate C residue for mammalian DNA 5-C-MTas must b in a CpG site but n ed not be base paired 
to a G residue in the complementary strand. A search for similar recognition sites in potential inter- and intramolecular 
hydrogen-bonded structures that can be formed by ODN A (Figure 1) was made using the STEMLOOP function of the 
Genetics Computer Group package (Version 7.3.1 -UNIX; September 1993) and the self-complementarity function of 
OLIGO (Version 3.4; National Biosciences, Hamel, MN). The structures generated by these programs indicated that 
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ODN A could form a homodimer with 32 hydrogen bonds and a variety of stem-loop structures. There are six potential 
stem loops stabilized by at least 3 adjacent base pairs. Those with more than 9 hydrogen bands or those with C7 posi- 
tioned todir ct methylation at C18 or C20 is shown in the drawings. Neither 5MeC in position 5 nor 5MeC in position 7 
is in a base-paired region in the homodimer non-loop structure or in base st m-loop (structures 3 and 4, Figures 4 and 

5 5). The structure, which contains a 10-bp-long stem stabilized by 19 hydrogen bonds (Structure 2, Figure 3) has a base- 
paired CpG site in the stem. When 5MeC is substituted for C5 in this site, a hemimethylated recognition site for DNA 5- 
C-MTase is formed with C18. No methylation of the C20 in structure 2 would be predicted, since C20G21 is paired with 
T3G4 and, thus, is not in a recognition site. Theoretically, a 5MeC in position 18 in structure 2 or 3 (Figures 3 and 4) 
could activate methylation at C5 or C12, respectively. In this regard, it is of interest that 5MeC in position 18 does acti- 

10 vate methylation of A, although with less efficiency than 5MeC in position 5 or 7. Structures 1 -4 do not, however, explain 
how 5MeC in position 7 can activate methylation at C1 8. Only one structure (5), shown in Figure 6, has a stem-loop C18 
positioned in a substrate recognition site containing 5MeC in position 7. It has a 3-bp-long stem, with only 9 hydrogen 
bonds, and has the substrate C1 8 in a non-base-paired position. X-ray diffraction studies of the structure of DNA in the 
active site of a bacterial DNA 5-C-MTase (Hha I) have recently been reported (Klimasauskas et al., Cell 76, 357-369 

7 5 (1 994)). They demonstrate that hydrogen bonds between the substrate C residue and the G residue in the complemen- 
tary strand are broken; the C residue is swung out of the helix, allowing methylation to occur. This may explain why sub- 
strate C residues that cannot be base paired are particularly good methyl acceptors, since their rotation out of the helix 
requires less energy than rotation of normally hydrogen-bonded C residues. In this regard, it should be noted that 5MeC 
in position 5 in structure 5 could potentially direct methylation at C20. However, no catalytic interaction was detected 

20 between murine DNA 5-C-MTase and A with 5MeC in position 5 and 5FC in position 20 (Table 10, line 10). This sug- 
gests a preferential binding of the MTase to the "mismatched" recognition site formed by hydrogen bonding between 
5MeC in position 7 and G1 9 over the fully base paired recognition site formed by hydrogen bonding between 5MeC5G6 
and C20G2. 

Although we do not have direct evidence for formation of these looped structures and cannot rule out the possibility 

25 that the computer algorithms used failed to detect some potential substrate sites with non-Watson-Crick pairing, the 
structures discussed allowed design of additional ODNs to test the hypothesis that formation of a loop with a mis- 
matched recognition site is necessary for 5MeC in position 7 to direct methylation at C1 8. Two ODNs with approximately 
the same base composition, a 5MeC in position 7, and the same number and spacing of CpG sites as ODN A were syn- 
thesized. One ODN (structure 6, Figure 7) forms a stem-loop of the same size with the same relationship between C7 

30 and C18 as A (structure 5, Figure 6). ft can also form a homodimer that is stabilized by 12 hydrogen bonds (structure 
7, Figure 8) with the substrate C18 in a ds region but not in a hemimethylated site. The other ODN forms a homodimer 
with 28 hydrogen bonds (structure 8, Figure 9) and five stem-loop structures stabilized by at least three adjacent hydro- 
gen-bonded base pairs. C7 is not base-paired in any of these structures. However, it cannot form a stem-loop structure 
analogous to structures 5 and 6. This ODN is methylated at the same rate as unmethylated A, whereas the ODN that 

35 can potentially form structure 6 is methylated at >300 times the rate of unmethylated A. 

Since little or no methylation of CpG sites (including C18) occurs when they are in an unmethylated ds ODN (A • 
A') or a ds ODN containing 5MeC in non-CpG sites including C7 (Table 7, lines 3, 7, and 8), it can be concluded from 
the data presented here that a stem-loop structure may be both necessary and sufficient to allow 5MeC in position 7 to 
direct methylation at a site 12 bases downstream. However, it is unlikely that such a stem-loop structure, containing only 

40 nine hydrogen bonds would exist in solution at 37°C unless it is stabilized by its interaction with DNA 5-C-MTase, per- 
haps with the aid of other nuclear proteins present in our extracts. 

The model proposed is consistent with our results. This model posits that the active site in mammalian DNA 5-C- 
MTase contains both a regulatory region and a catalytic site. The regulatory region limits the rate of methyl transfer at 
the catalytic site. Interaction between 5MeC and the regulatory region relieves this inhibition, leading to an increased 

<5 rate of methylation of a substrate C residue. In dsDNA, activation occurs primarily at hemimethylated recognition sites 
in which the substrate C need not be hydrogen bonded to a G in the complementary strand (Smith et al., Biochemistry 
31, 850-854 (1992)). Our results indicate that 5MeC in a looped ssDNA can also activate methylation of a substrate C 
residue in instances that allow DNA to form a structure in the active site analogous to the recognition sites for DNA 5- 
C-MTase in dsDNA (structures 2, 5, and 6). Based on this model, our results would further suggest that a short ds 

so region of 3 bp including a base pair between the 5MeC residue and the G residue in the substrate CpG site is sufficient 
to activate methylation but only when the C to be methylated is not hydrogen bonded to a G. 

If one assumes that ss regions in larger DNA molecules can form similar looped structures in the active site of DNA 
MTase, it is evident that this mechanism could account for methylation of CpG sites at some distance from an estab- 
lished methylation site. Methylation would occur in cis through formation of recognition sites in stem-loops and might 

55 also occur in trans when ss loops from different DNA molecules or regions are brought close enough in the nucleus to 
form recognition sites, ss regions in DNA occur during the course of normal DNA replication and repair and may also 
be available as a result of "melting out" of DNA regions through protein binding or through formation of cruciform struc- 
tures. When a ss region is converted back to dsDNA, through reannealing with its compieTrent or through replication, 
a hemimethylated recognition site is formed, which is then a substrate for maintenance methylation. While the proposed 
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mechanism is supported by observations that de novo methylation of integrated viral genomes or repeat elements can 
spread from a founder site, further studies will be required to confirm that 5MeC in ss regions of DNA can actually acti- 
vate methylation at distant sites in DNA of living ceils. It will also be of interest to examine the possibility that the small 
percentage of 5MeC residues that are not in CpG sites in mammalian DNA influence the extent of methylation at adja- 

5 cent CpG sites and to determine whether these 5MeC residues indicate the existence of a directive mechanism for de 
novo methylation mediated by additional DNA MTases or simply random errors introduced by normal DNA 5-C-MTase 
with relaxed specificity due to posttranslational modification or partial proteolytic degradation. 

In summary, our results provide evidence for a mechanism whereby the single DNA 5-C-MTase found in mamma- 
lian cells can be as active in directing de novo methylation as it is in maintaining established patterns of methylation. 

70 They also suggest a rationale for hypothesizing that the specificity required for establishing tissue-specific patterns of 
methylation is determined by a combination of inherent factors that include the ability of particular sequences to form 
the required stem-loop structures in ssDNA, the lifetime of the single-stranded state, and the availability of proteins to 
stabilize or destabilize particular looped structures. 



TABLE 1 


DNA ANALOG CONSTRUCTS 


SENSE (S): 


3' TAACGCGTAAGGCCTAGGCGCTAG 5' 


Seq. ID #6 


ANTISENSE (AS): 


5' ATTGCGCATTCCGGATCCGCGATC 3' 


Seq. ID #7 


METHYLATED AS (M1): 


5' ATTGNGCATTCNGGATCNGNGATC 3' 


Seq. ID #8 


METHYLATED AS (M2): 


5* ATTGCGNATTNCGGATNCGCGATC 3' 


Seq. ID #9 


N = 5-METHYLCYTOSINE 




Underlined sites function as substrate sites for the indicated bacterial DNA methyl- 


transferase but onlv in double stranded ODNs 





TABLE 2 



35 


DNA METHYLASE SUBSTRATE ACTIVITY 
OF DNA ANALOG CONSTRUCTS 


DNA ANALOG 


ACTIVITY (CPM) 




BACKGROUND 


2691 




S 


4843 


40 


AS 


12341 




M1 


5487 




M2 


882284" 


45 


RESULTS ARE AVERAGE OF 2 EXPTS. 
ENZYME PREPARATION FROM FRIEND 
LEUKEMIA CELLS. ASSAYED WITH 1(iG 
OLIGOMER 



so 



55 
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TABLE 3 



DNA METHYLASE SUBSTRATE ACTIVITY OF 
DNA ANALOG CONSTRUCTS 


DNA ANALOG 


ACTIVITY (CPM) 


BACKGROUND 


1351 


S 


175225 


AS 


49282 


M1 


2339 


M2 


51047 


RESULTS ARE AVERAGE OF 2 EXPTS. BACTE- 
RIAL SSS1 ENZYME PREPARATION. ASSAYED 
WITH 0.25jiG OLIGOMER. 
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TABLE 4 

DNA METHYLASE (FL) SUBSTRATE ACTIVITY OF D OUBLE STRANDED 

DNA ANALOG CONSTRUCTS 

DNA ANALOG ACTIVITY fCFfl) 



BACKGROUND 


ion 


S 


1598 


AS 


1936 


Ml 


1373 


M2 


157027 


S + AS 


2623 


S + Ml 


125686 


S + M2 


9215 



RESULTS ARE AVERAGE OF 2 EXPTS . 
DOUBLE STRANDED DNA ANALOG CONSTRUCTS: 



S 5' GATCGCGGATCCGGAATGCGCAAT 3' Seq. ID # 6 

AS 3' CTAGCGCCTAGGCCTTACGCGTTA 5' Seq. ID # 7 

S 5' GATCGCGGATCCGGAATGCGCAAT 3' Seq. ID # 6 

Ml 3 ' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 8 

S 5' GATCGCGGATCCGGAATGCGCAAT 3' Seq. ID # 6 

M2 3' CTAGCGCNTAGGCNTTANGCGTTA 5' Seq. ID # 9 



N = 5-methylcytosine 



EP0 756 008 A2 



10 



15 



TABLE 5 

DOUBLE STRANDED DNA ANALOGS BASED ON S + Ml AS POTENTIAL 
INHIBITORS OF DNA METHYLASE (FL) 

S + Ml ACTIVITY (cpm) 

S 5' GATCGCGGATCCGGAATGCGCAAT 3' 140,000 Seq. ID # 6 

Ml 3' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 7 

5-FLUOROCYTOSINE CONTAINING ANALOGS OF S + Ml 

I 

5' GATFGCGGATCCGGAATGCGCAAT 3' ND Seq. ID #10 

3' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 7 

I 

5' GATCGFGGATCCGGAATGCGCAAT 3' ND Seq. ID #11 

3' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 7 

I 

5' GATCGCGGATFCGGAATGCGCAAT 3' 140,000 Seq. ID #12 

3' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 7 

; 

5' GATCGCGGATCFGGAATGCGCAAT 3' 28,000 Seq. ID #13 

3' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 7 

I 

30 5' GATCGCGGATCCGGAATGFGCAAT 3-' 36,000 Seq. ID #14 

3' CTAGNGNCTAGGNCTTACGNGTTA 5' Seq. ID # 7 



F « 5-FLUOROCYTOSINE 
35 N = 5-METHYLCYTOSINE 



50 



25 



TABLE 6 



SINGLE-STRANDED ODN 


RELATIVE ACTIVITY* 1 


AS 

ASM 7 

ASM 7 F 18 


5' ATTGCGCATTCCGGATCCGCGATC 3' 
5' ATTGCGNATTCCGGATCCGCGATC 3* 
5' ATTGCGN ATTCCGGATC RGCGATC 3' 


1 

170 
3 


Seq. ID # 7 
Seq. ID #15 
Seq. ID #2 


N = 5-METHYLCYTOSINE 



* Rate of methylation relative to an equal amount of AS 

so 
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TABLE 7 



c 


ACTIVITY OF PHOSPHOROTHIOATE END CAPPED OLIGOMERS AS SUBSTRATES 
OR INHIBITORS OF DNA METHYLTRANSFERASE 




Oligomer 


Structure 


Rate of m thylation 








(cpm/|ig oligo/30 min) 


10 


A* (capped ASM7F18) 


ATTGCGNATTCCGGATCFGCGATC 


3.310 


Seq. ID #16 


B** 


ATGGGATCCCATGGGTTNCCFATC 




Seq. ID #17 




C (capped ASM7) 


ATTGCGNATTCCGGATCCGCGATC 


114,760 


Seq. ID #18 




D (ASM7) 


ATTGCGNATTCCGGATCCGCGATC 


72,010 


Seq. ID #19 


15 


N = 5-methylcytosine 
F = 5-f luorocytosine 



'Proposed active oligomer 

"same base composition but no methylatabte CG site 

20 



25 



30 



35 



40 



45 



50 



55 
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TABLE 8 

Comparison of the rates of methylation of ss and ds ODNs by 
murine DNA C-5-MTase 



ODN Initial rate, pmol/30 min rate 

Umnethylated substrates (de novo methylation) 
A 0.14 ± 0.01 1 

A' 0.05 ± 0.01 0.35 

A' A' 0.05 ± 0.01 0.35 

Hemimethylated substrates (maintenance methylation) 
20 A-A'Mp 18.6 ± 2.61 133 

AMp'A' 19.2 ± 1.2 138 



75 



25 



AMp-A'M p ND 

A" A'M 0.05 ± 0.005 0.35 

AM*A' 0.09 ± 0.01 0.64 

30 a is 5'-ATTGCG£ATTCCGGATCCGCGATC-3'=AS=Antisense; A' IS 3'- 

TAA£GCGTAAGG-CCTAGGCGCTAG-5 ' =S=Sense . M p and M indicate 5MeC 
in place of all boldfaced or underlined C residues, 

35 respectively. A*A'M p = AS + M 3 ; AM p *A' = M 1 +S; AM X * A' = M 2 + S; 

A"A'M X =AS + M 4 

40 



45 



50 



55 
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TABLE 9 



5 



IS 



25 



SITES 


SUBSTITUTION 


D ATT 
HAl C 


ODN A 




A AO 

A = Ab 


1 . 


none 


1 


2. 


c iO iD OA 


Kin KA AU 
INIU = M«| = AMp 


3. 


7,11,17 


1 30 = M 2 = AM X 


A 

*T. 




1 75 - ASMc 


5. 


7 


H Ar\ A Chi 

140 = A0M7 




1 / 


Q C _ ACM 


7. 


18 


iO ACM 

12 = AbM-18 


8. 


20 


O C A Ofc J 

2.5 = ASM20 


UUN A 




A* _ Q 

A = 0 


1. 


none 


0.35 


2. 


4,6,12,19 


NU = M3 = oWIp 


o 
o. 


1 i Ol 

1 1 % d\ 


/D = 1VI4 = oM x 


A 

4. 


A 

4 


U.4 


5. 


6 


42 


6. 


11 


1.7 


7. 


12 


67 


8. 


19 


1 


9. 


21 


37 


10. 


6.19 


1 
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50 
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Table 10 







SUBSTITUTION 


RATE 






POSITIONS 








5mC 


5FC 




A(AS) 


1. 


none 


none 


1 


AM X (M 2 ) 


2. 


7,11.17 


none 


128 




3. 


7,11,17 


5,12,18.20 


ND 




4. 


7,11,17 


5 


170 




5. 


',11,1 f 


12 


165 




6. 


7.11.17 


18 


2 




7 


7,11.17 


20 


1 Aft 
I DO 


ASM 5 


8. 


5 


none 


150 




9. 


5 


18 


. 1 




10. 


5 


20 


152 


ASM 7 


11. 


7 


none 


130 


ASM 7 F 18 


12. 


7 


18 


3.2 




13. 


7 


20 


160 
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GENERAL INFORMATION: 

(i) APPLICANT: SUFRIN, JANICE R. 

CHRISTMAN, JUDITH K. 
MARASCO JR., CANIO J. 
SHEIKHNEJAD, GHOLAMREZA 

(ii) TITLE OF INVENTION : SUBSTRATE FOR DETECTION OF 
MAMMALIAN 5-C-DNA METHYLTRANSFERASE 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: DUNN & ASSOCIATES 

(B) STREET: P.O. BOX 96 

(C) CITY: NEWFANE 

(D) STATE: NEW YORK 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 14108 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE , 3.5 INCH. 1.44 MB 

(B) COMPUTER: VICTOR 300 SX/25 

(C) OPERATING SYSTEM: MS-DOS VERSION 5.0 

(D) SOFTWARE: WORDSTAR PROFESSIONAL RELEASE 4 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/508,778 

(B) FILING DATE: 28-JUL-95 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: DUNN, MICHAEL L. 

(B) REGISTRATION NUMBER: 25,330 

(C) REFERENCE/DOCKET NUMBER: RPP:142 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (716) 433-1661 

(B) TELEFAX: (716) 433-1665 

INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE : 
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(vii) IMMEDIATE SOURCE: 
1 (A) LIBRARY: 
(B) CLONE 

5 (viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 
(ix) FEATURE : 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 
™ (B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES : 
20 (G) DATE: 

(H) DOCUMENT NUMBER : 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO. : 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1 
25 ATTGCGNATT NCGGATNRGC GATC 24 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 
30 ( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : 

(iii) HYPOTHETICAL: 
35 <iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

<5 (I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

so (A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 



40 
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(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 
10 (E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

J5 (J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO. 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
ATTGCGNATT CCGGATCRGC GATC 24 

(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 
<o (A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 
45 (C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
so ( X ) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 



30 



35 
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(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

( H ) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO 
ATTGNGCATT CCGGATCRGC GATC 24 



(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 
(v> FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME : 

(E) ISSUE: 

(F) PAGES: * 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 
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10 



20 



25 



(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
ATTGNGCATT CNGGATCNGN CATC 24 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 
15 (iv) ANTI-SENSE: 

(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

30, (A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

35 (B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE : 

45 (H) DOCUMENT NUMBER: 

(I) FILING DATE: 
(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

so TAARGCGTAA GGRCTAGGRG GTAG *. 24 



40 



(7) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE : 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
TAACGCGTAA GGCCTAGGCG CTAG 24 

(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 
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10 



20 



(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE : 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

75 (viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 
25 (B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

30 (G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ "ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
ATTGCGCATT CCGGATCCGC GATC 24 

(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
as (iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: ■ 

50 (C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 
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(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(z) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
ATTGNGCATT CNGGATCNGN GATC 24 

(10) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 
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(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 
5 (ix) FEATURE: 

(A) NAME/KEY: 
(3) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 
"(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE : 

J5 (F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
20 (K) RELEVANT RESIDUES IN SEQ ID NO: 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
ATTGCGNATT NCGGATNCGC GATC 24 



25 



(11) INFORMATION FOR SEQ ID NO: 10: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
30 (ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 
45 (B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B ) LOCATION: 

(C) IDENTIFICATION METHOD: 



40 



50 
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(D) OTHER INFORMATION : 
(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GATNGCGGAT CCGGAATGCG CAAT 24 

(12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 
<B) TITLE: 

(C) JOURNAL-: 

(D) VOLUME: 

(E) ISSUE: 

( F) PAGES: 
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(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GATCGNGGAT CCGGAATGCG CAAT 24 

(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GATCGCGGAT NCGGAATGCG CAAT 24 
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(14) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 



(B) 


STRAIN : 


(C) 


INDIVIDUAL ISOLATE: 


(D) 


DEVELOPMENTAL STAGE: 


(E) 


HAPLOTYPE : 


(F) 


TISSUE TYPE: 


(G) 


CELL TYPE: 


(H) 


CELL LINE: 


(I) 


ORGANELLE : 


(vii) IMMEDIATE SOURCE: 


(A) 


LIBRARY: 


(B) 


CLONE 


(viii) POSITION IN GENOME: 


(A) 


CHROMOSOME/SEGMENT: 


(B) 


MAP POSITION: 


(C) 


UNITS: 


(ix) FEATURE: 


(A) 


NAME /KEY: 


(B) 


LOCATION: 


(C) 


IDENTIFICATION METHOD: 


(D) 


OTHER INFORMATION: 


(2) PUBLICATION INFORMATION: 


(A) 


AUTHORS : 


(B) 


TITLE: 


(C) 


JOURNAL : 


(D) 


VOLUME : 


(E) 


ISSUE: 


(F) 


PAGES: 


(G) 


DATE: 


(H) 


DOCUMENT NUMBER: 


(I) 


FILING DATE: 


(J) 


PUBLICATION DATE: 


(K) 


RELEVANT RESIDUES IN SEQ ID NO: 



(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



GATCGCGGAT CNGGAATGCG CAAT 24 

(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 
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(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 
15 (B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

25 (A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

30 (F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 
(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GATCGCGGAT CCGGAATGNG CAAT 24 



20 



35 



(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
45 (iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN:- 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 



55 
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(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
ATTGCGNATT CCGGATCCGC GATC 24 

(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 
<B)- STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 
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(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 
£ (ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

10 (A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

75 (F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
20 (K) RELEVANT RESIDUES IN SEQ ID NO: 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
ATTGCGNATT CCGGATCNGC GATC 24" 

(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 
30 (iii) HYPOTHETICAL: 

(iv) ANTI -SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

, (A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

40 (H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

4$ (viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
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SO 
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(X) PUBLICATION INFORMATION : 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
ATGGGATCCC ATGGGTTNCC NATC 24 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE : 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME : * 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 
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(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
ATTGCGNATT CCGGATCCGC GATC 24 

(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: ~ 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION : 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
ATTGCGNATT CCGGATCCGC GATC 24 

(21) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE : 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GATRGRGGAT CRGGAATGRG CAAT 24 

(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYFE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

10 (H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

, 5 (viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 
<ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(X) PUBLICATION INFORMATION: 

(A) AUTHORS: 
25 (B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

20 (G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 
(K) RELEVANT RESIDUES IN SEQ ID NO: 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
55 ATGACGCACC TCGTTGACGC GCTA 24 

(23) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 
*o (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: 
45 (iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: 
50 (D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 
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(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 
<ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 
(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES. IN SEQ ID NO: 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
ATTGCGCTAC TCGGATCCGG CCAT 24 



Claims 

1 . A substrate selective for detection of mammalian 5-C-DNA mefhyltransferase in the presence of bacterial 5-C-DNA 
methyltransferase, characterized in that said substrate comprises oligomeric DNA which contains at least one 5- 
methylcytosine residue, and at least one cytosine or 5-fluorcytosine residue, each of which are followed in linkage 
to a guanine residue. 

2. The substrate of Claim 1 wherein the oligomer is single stranded or double stranded. 

3. The substrate of Claim 1 or 2 wherein the ODN contains from 12 to 50 bases and at least one cytosine-guanine 
linkage where the cytosine may be cytosine, 5-methylcytosine or 5-fluorcytosine and at least one cytosine group is 
5-m ethyl cytosine. 

4. The substrate of one of the Claims 1 to 3 wherein the substrate also contains at least one 5-fluorcytosine. 

5. The oligomeric DNA strand ATTGNGCATTCNGGATCNGGNCATC where N is 5-methylcytosine. 

6. The oligomeric DNA strand GATRGRGGATCRGGAATGRGCAAT where R is cytosine or 5-fluorcytosine. 

7. The oligomeric DNA strand ATTGCGNATTNCGGATNRGCGATC characterized in that N is 5-methylcytosine and 
where R is cytosine or 5-fhjorocytosine. 

8. The ologomeric DNA strand ATTGCGNATTCCGGATCRGCGATC and ATTGNGCATTCCGGATCRGCGATC char- 
acterized in that N is 5-methylcytosine and where R is cytosine or 5-f luorocytosine. 

9. The substrate of one of the Claims 1 to 4 containing at least one 5-methylcytosine and at least one CG or FG site 
that can form looped structures. 



36 



EP0 756 008 A2 

10. A method for measuring the presence of mammalian 5-C-DNA methyltransferase characterized in that a sample 
containing 5-C-DNA methyl transferase is contacted with the substrate of Claim 1, 2, 4 or 5. 

11. A method for inhibiting mammalian 5-C-DNA methyltransferase characterized in that a sample containing 5-C- 
5 DNA methyl transferase is contacted with the substrate of Claim 1 , 2, 4 or 5. 
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2 3 

1=C, 5mC; 2=G or Inostne (I); 3=C, site of methylation 
4=G, I, 0 6 -methyl G, C, A or an abasic site 



FIG. 1 



H=32 

5 7 12 18 20 

5'- A TTGCGCATTC CGGATCCGCGATC 
II- I I I I I I I I -|| 
3'-CTA GCGCCTAGGCCT TACGCGTT A-5' 
20 18 12 7 5 

FIG. 2 
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FIG. 3 
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FIG. 4 
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FIG. 5 
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FIG. 9 
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(54) Substrate for detection of mammalian 5-C-DNA methyltransferase 



(57) A substrate selective for detection of mamma- 
lian 5-C-DNA methyltransferase in the presence of bac- 
terial 5-C-DNA methyltransferase, said substrate 
comprising oligomeric DNA which contains at least one 
5-methylcytosine residue, and at least one cytosine or 
5-fluorocytosine residue, each of which are followed in 
linkage to a guanine residue. The invention also 
includes a method for measuring the presence of mam- 



malian 5-C-DNA methyltransferase which comprises 
contacting a sample containing 5-C-DNA rnethyl trans- 
ferase with the substrate and also includes a method for 
inhibiting mammalian 5-C-DNA methyltransferase 
which comprises contacting a sample containing 5-C- 
DNA methyl transferase with the substrate. 
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